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The "time-asymmetry" attributed to the standard formulation of Quan- 
tum Mechanics (QM) was inherited from a reasonable tendency learned from 
Classical Mechanics (CM) to predict the future based on initial conditions: 
once the equations of motion are fixed in CM, then the initial and final condi- 
tions are not independent, only one can be fixed arbitrarily. In contrast, as a 
result of the uncertainty principle, the relationship between initial and final 
conditions within QM can be one-to-many: two "identical" particles with 
identical environments can subsequently exhibit different properties under 
identical measurements. These subsequent identical measurements provide 
fundamentally new information about the system which could not in prin- 
ciple be obtained from the initial conditions. Although this lack of causal 
relations seemed to conflict with basic tenets of science, many justified it 
by arguing "nature is capricious." This lead to Einstein's objection "God 
doesn't play dice." Nevertheless, after 100 years of experimental verification, 
QM has won over Einstein's objection. 

QM's "time-asymmetry" is the assumption that measurements only have 
consequences after they are performed, i.e. towards the future. Neverthe- 
less, a positive spin was placed on QM's non-trivial relationship between 
initial and final conditions by Aharonov, Bergmann and Lebowitz (ABL) [2] 
who showed that the new information obtained from measurements was also 
relevant for the past of every quantum-system and not just the future. 
This inspired ABL to re-formulate QM in terms of Pre-and-Post-Selected- 
ensembles. The traditional paradigm for ensembles is to simply prepare 
systems in a particular state and thereafter subject them to a variety of 
experiments. These are "pre-selected-only-ensembles." For pre-and-post- 
selected-ensembles, we add one more step, a subsequent measurement or 
post-selection. By collecting only a subset of the outcomes for this later 
measurement, we see that the "pre-selected-only-ensemble" can be divided 
into sub-ensembles according to the results of this subsequent "post-selection- 
measurement." Because pre-and-post-selected-ensembles are the most refined 
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quantum ensemble, they are of fundamental importance and subsequently led 
to the two-vector or Time- Symmetric re-formulation of Quantum Mechanics 
(TSQM) [HIE]- TSQM provides a complete description of a quantum-system 
at a given moment by using two-wavefunctions, one evolving from the past 
towards the future (the one utilized in the standard paradigm) and a second 
one, evolving from the future towards the past. 

While TSQM is a new conceptual point-of-view that has predicted novel, 
verified effects which seem impossible according to standard QM, TSQM is 
in fact a re-formulation of QM. Therefore, experiments cannot prove TSQM 
over QM (or vice- versa). The motivation to pursue such re-formulations, 
then, depends on their usefulness. The intention of this article is to an- 
swer this by discussing how TSQM fulfils several criterion which any re- 
formulation of QM should satisfy in order to be useful and interesting: 

• TSQM is consistent with all the predictions made by standard QM 

• TSQM has revealed new features and effects of QM that were missed 
before (§2}, 

• TSQM has lead to new mathematics, simplifications in calculations, 
and stimulated discoveries in other fields (as occurred, e.g., with the 
Feynman re-formulation of QM) §3j 

• TSQM suggests generalizations of QM that could not be easily articu- 
lated in the old language (§31 

A more conservative scientist may choose to utilize all the pragmatic, oper- 
ational advantages listed abovd3, but stick to the standard time-asymmetric 
QM formalism. Our view is that these new effects form a logical, consistent, 
and intuitive pattern (in contrast to the traditional interpretation). There- 
fore, we believe there are deeper reasons which underly TSQMs success in 
predicting them. One generalization suggested by TSQM ( §4.ip addresses 
the "artificial" separation in theoretical physics between the kinematic and 
dynamical descriptions [23]; another ( §4.21) provides a novel solution to the 

1 While this paper focuses on theoretical issues, we emphasize that many of the novel 
predictions have been tested in quantum-optics laboratories utilizing Townes' laser tech- 
nology [66]. In addition, TSQM has suggested a number of innovative new technologies 
which could be implemented with lasers ^3.11 
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measurement problem. Consequently, we are able to change the meaning of 
uncertainty from "capriciousness" to exactly what is needed in order that the 
future can be relevant for the present, without violating causality, thereby 
providing a new perspective to the question "Why does God play dice?" 
( §5.ip In other words, TSQM suggests that two "identical" particles are not 
really identical, but there is no way to find their differences based only on 
information coming from the past, one must also know the future. We also 
show how the second generalization involving "destiny" is consistent with 
free-will ( §5.21) . Finally, we speculate on the novel perspectives that TSQM 
can offer for several other themes of this volume, such as emergence. 

1 Consistency of Time- symmetric Quantum 
Mechanics with Standard Quantum Mechan- 
ics 

We first motivate TSQM with a paradox concerning the relativistic covari- 
ance of the state-description in QM. 

1.1 Motivation - A Relativistic Paradox 

Consider two experimentalists A and B corresponding to two spin- 1/2 parti- 
cles prepared in a superposition with correlated spins (a a + o"b =0), i.e. in 
an EPR state: 

\*EPn{t = 0)> = {||)a |I)b- |1)a |T)b} (1.1) 

Suppose at some later time t 2 , the particles separate to a distance L and 
A measures his spin in the z-direction and obtains the outcome ||z)a- Ac- 
cording to the usual interpretation, ideal-measurements on either particle will 
instantly reduce the state from a superposition \*$! epr^—z)) = \^epr(®)) = 
75{|T)a ||)b- I J.) a |T)b} into a direct product \^(t 2 + e) = |T*)a|J,»)b. I.e. 
after A performs his measurement at t — t 2 , then the joint wavefunction 
collapses so B's wavefunction also collapses to | | 2 )b which can be confirmed 
if B actually performs a measurement. When should B perform this mea- 
surement? Consider a "lab" frame-of-reference which is at rest relative to A 
and B, in which case the collapse is simultaneous (see figure [Da) as indicated 
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by the space-time coordinates: 




However, what is simultaneous in one frame-of-reference is not simultaneous 
in another: e.g. as we change to a rocket frame-of-reference which moves 
with velocity (3 = - in the x direction (with the same space-time origin), 
then the "plane-of-simultaneity" changes (see figure [TJb) as can be seen with 
the new coordinates after a Lorentz-transformation: 

a . ( 7 -fa \( ct 2 \ / jct 2 \ 
■ {-fa 1 { ) { -fact, ) 

R . 7 ct A= lch \ 

■ [-fa 7 J \ L ) \ -fact 2 + jL ) 

In the rocket frame-of-reference, the collapse of the wavefunction of B hap- 
pens at t\ = 7^2 — ^7-^ < t 2 , i.e. the rocket frame-of-reference notices at 
t\ < t 2l that B is in the state | |^)b, implying that the joint EPR wavefunc- 
tion had collapsed at t\ or before, so the state of A should be | | 2 )a no later 
than ti. If we transform back to our lab-frame-of-reference: 

- U?)(-k)-( i v 1 ) <-» 

we see that the particle on A's side was in the | j \ z )a state even before A 
made the measurement at t 2 (contradicting our notion that A's measurement 
supposedly caused the collapse in the first place.) 

In summary, this paradox focuses on "when did the collapse take place?" . 
In the lab-frame, A's measurement occurs first and then B's measurement 
occurs (see figure [TJa). However, in a rocket-frame, B's measurement occurs 
first and then A's (see figure [TJb). The lab-frame believes that A's measure- 
ment caused the collapse whereas the rocket-frame disagrees and believes that 
B's measurement caused the collapse. While the two different versions give 
the same statistical results at the level of probabilities, they differ completely 
on the state-description during the intermediate times and there is nothing 
in QM to suggest which version is the correct one. A similar arrangement 
has been probed experimentally producing results consistent with TSQMs 
hypothesis. [80] @ This paradox has two possible resolutions: 

2 This paradox can be sharpened in several ways [32] ■ 
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hypersurface intersects the B worldline before -B's measurement, b) the i roc fc e t 
hypersurface intersects the B worldline after -B's measurement. 








1. Collapse cannot be described covariantly in a relativistic theory at the 
level of the state-description, only at the level of probabilities. This 
thereby precludes progress on questions such as "Why God plays dice." 

2. As first pointed out by Bell [1EJIE1E]; Lorentz-covariance in the state- 
description can be preserved in a theory like TSQM [32] ( §2.41) . In 
addition, we believe it to be the most fruitful approach to probe deeper 
quantum realities beyond probabilities, thereby providing insight to 
questions like "Why God plays dice." 

1.2 The Main Idea 

TSQM contemplates measurements which occur at the present time t while 
the state is known both at t in < t (past) and at t nn > t (future). More pre- 
cisely, we start at t = t- in with a measurement of a nondegenerate operator 
O m . This yields as one potential outcome the state |^in), he. we prepared 
the "pre-selected" state l^in). At the later time t nn , we perform another 
measurement of a nondegenerate operator O nn which yields one possible out- 
come: the post-selected state |\l/ nn ). At an intermediate time t G [t m , tfmL 
we measure a non-degenerate observable A (for simplicity), with eigenvec- 
tors {|ctj)}. We wish to determine the conditional probability of Oj, given 
that we have both boundary conditions, |^i n ) and (\I/fl n | . |f| To answer this, 

3 Such an arrangement has long been considered in actual experiments: consider a bub- 
ble chamber scattering experiment. The incoming particle, l^in), interacts with a target 
and then evolves into various outgoing states, l^nn)]., | , 3 / nn)2, etc. Typically, photographs 
are not taken for every target-interaction, but only for certain ones that were triggered 
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we use the time displacement operator: Ut in ~>t = exp{—iH(t — tin)} where 
H is the Hamiltonian for the free system. For simplicity, we assume H is 
time independent and set H = 1. The standard theory of collapse states 
that the system collapses into an eigenstate \aj) after the measurement at t 
with an amplitude (a j\U tin ^t\^ in) ■ The amplitude for our series of events is 
ctj = (^ nn \U t ^t fin \ a j)( a j\Ut in ~,t\^m) which is illustrated in figure [2J a. This 
means that the conditional probability to measure a,j given l^in) is pre- 
selected and l^fin) will be post-selected is given by the ABL formula [2] 
0: 

fr[aj, t|W in , t in , W fin , t fin J - — 7- 1 i rf hT . mo 



As a first step toward understanding the underlying time-symmetry in the 
ABL formula, we consider the time-reverse of the numerator of eq. 11.61 and 
figure Ela. First we apply Ut-*^ on (^fin| instead of on (a,j\. We note 
that (\l/fi n |L^_» tfln = (Ut^ ttin ^nn\ by using the well-known QM symmetry 
uU tfin = {e-^*^-*)} 1 = e iH ^~^ = e -*^(*-*fl») = U t&a ^f We also apply 
Utfo-tt 011 ( a j\ instead of on \^f- m ) which yields the time-reverse re- formulation 
of the numerator of eq. 11.61 (U tfin ^t^ nn\aj) (Ut^,t in a>j\^ in) as depicted in fig. 
[2jb. Further work is needed to formulate what we mean by the 2-vectors 
in TSQM. E.g. if we are interested in the probability for possible outcomes 
of Oj at time t, we must consider both U tin ^ t \^m) and (^t fin ->t^fin| : since 
these expressions propagate the pre-and-post-selection to the present time t 
(see the conjunction of both figures [2Ja and[2Jb giving [2Jc which is re-drawn 
in figure Ob; these 2-vectors are not just the time-reverse of each other). 
This represents the basic idea behind the Time-Symmetric re-formulation of 



by subsequently interacting with detectors. In CM, there is (in principle) a one-to-one 
mapping between incoming states and outgoing states, whereas in QM, it is one-to-many. 
By selecting a single outcome for the post-selection-measurement, wc define the prc-and- 
post-selected-ensemble that has no classical analog. 

4 ABL is intuitive: | (a : ,-|[/ tin _> t l^in) | 2 is the probability to obtain \a,j) having started with 
l^in). If \aj) was obtained, then the system collapsed to \a,j) and K^nnlE^t-^tfiJttj)! 2 is then 
the probability to obtain j^gn). The probability to obtain \aj) and |^fi n ) then is |<x,| 2 . 
This is not yet the conditional probability since the post-selection may yield outcomes 
other than (^ nn \. The probability to obtain |* fin ) is £\ |o,-| 2 = | (*fi n |*in> | 2 < 1- The 
question being investigated concerning probabilities of aj at t assumes we are successful in 
obtaining the post-selection and therefore requires the denominator in eq. I1.6[ Y) j \oij\ , 
which is a re-normalization to obtain a proper probability. 
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Figure 2: Time-reversal symmetry in probability amplitudes. 
Quantum Mechanics (TSQMj: 
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While this mathematical manipulation clearly proves that TSQM is consis- 
tent with QM, it yields a very different interpretation. For example, the 
action of Ut Gn -tt 011 (^fin| (i-e. (U tfin ^t^&n\) can be interpreted to mean that 
the time displacement operator Ut Rn -tt sends (^fi n | back in time from the 
time tfl n to the present, t. A number of new categories of states (figure [3]) are 
suggested by the TSQM formalism and have proven useful in a wide variety 
of situations. 

In summary, the ABL formulation clarified a number of issues in QM. 
E.g.: in this formulation, both the probability and the amplitude are sym- 
metric under the exchange of l^m) and I^J/gn). Therefore, the possibility of 
wavefunction collapse in QM does not necessarily imply irreversibility of an 



5 We note that because (full) collapses take place at the t- m and ta n measurements, there 
is no meaning to information coming from t > ifi n or t < t- m . Therefore, at least in this 
context, there is no meaning to a "multi-vector" formalism. 
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arrow of time at the QM level. Nevertheless, the real litmus test of any 
re-formulation is whether conceptual shifts can teach us something funda- 
mentally new or suggest generalizations of QM, etc. The re-formulation to 
TSQM suggested a number of new experimentally observable effects, one im- 
portant example of which are weak-measurements (52]), which we now begin 
to motivate by considering strange pre-and-post-selection effects. 

1.2.1 Pre-and-post-selection and Spin-1/2 

One of the simplest, surprising, example of pre-and-post-selection is to pre- 
select a spin-1/2 system with l^in) = \a x = +1) = |ta) at time t in . After the 
pre-selection, spin measurements in the direction perpendicular to x yields 
complete uncertainty in the result [§, so if we post-select at time t^ n in the 
y-direction, we obtain l^fm) = \a y = +1) = | ] y ) half the time. Since the 
particle is free, the spin is conserved in time and thus for any t £ [ti n ,ifi n ], 

6 E.g. in the z-basis the state is -^(| ] z ) + | [ z )) which yields equal probability either 
spin-up or spin-down in the z-direction. 
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an ideal- measurement of either a x or a y , yields +1 for this pre-and-post- 
selection. This by itself, two non-commuting observables known with cer- 
tainty, is a most surprising property which no pre-selected-only-ensemble 
could possess. 

We now ask a slightly more complicated question about the spin in a 
direction £ = 45° relative to the x — y axis. This yields: 

a € = a x cos 45° + a y sin 45° = — + ° y (1.7) 

From the results Pr(a x = +1) = 1 and Pr(a y = +1) = 1, one might 
wonder why we couldn't insert both values, 6" x = +1 and er y = +1 into 
eq. 11.71 and obtain erg = = -\ = y/2. Such a result is incorrect for an 
ideal-measurement because the eigenvalues of any spin operator, including 

<j£, must be ±1. The inconsistency can also be seen by noting ( ax ^ v ) = 

<r x +<T v +(rxVy+<Ty(T X _ 1+1+0 _ ^ implementing the above argument, we 

would expect ( j x ^ v ^j = ( ^f ) =2^1. Performing this step of replacing 
a x = +1 and cr y = +1 in eq. II .71 can only be done if <r x and <7 y commute, which 
would allow both values simultaneously to be definite. Although it appears 
we have reached the end-of-the-line with this argument, nevertheless, it still 
seems that there should be some sense in which both Pr(a x = +1) = 1 and 
Pr(a y = +1) = 1 manifest themselves simultaneously to produce = 

1.2.2 Pre-and-post-selection and 3-Box-paradox 

Another example of a surprising pre-and-post-selection effect is the 3-box- 
paradox [3] which uses a single quantum particle that is placed in a su- 
perposition of 3 closed, separated boxes. The particle is pre-selected to 
be in the state |* in ) = l/v 7 ^ (\A) + \B) + |C)), where \A), \B) and |C) 
denote the particle localized in boxes A, B, or C, respectively. The par- 
ticle is post-selected to be in the state |Wg n ) = l/y/3 (\A) + \B) — \C)). 
If an ideal-measurement is performed on box A in the intermediate time 



7 This is also evident from ABL: the probability to obtain <j£ = +1 at the intermediate 
time if an ideal-measurement is performed is Pr{a^ = +1) = 1 + COb (^)+^^||)+™^) sln (S) _ 
We see that if £ = 0° (i.e. a x ) then the intermediate ideal-measurement will yield a x = +1 
with certainty and when £ = 90° (i.e. a y ), then the intermediate ideal-measurement will 
again yield a y = +1 with certainty. E.g. (7^=45 = ±1 is displayed in figure [9la 
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(e.g. we open the box), then the particle is found in box A with cer- 
tainty. This is confirmed by the ABL [2] probability for projection in A: 

- |(* fln |P,4|* in )| 2 



Pr(P, 



1. This can also be seen intu- 



l(ffln|PAl*m}| 2 + l(*fl n |Ps+Pdfi„}| 2 

itively by contradiction: suppose we do not find the particle in box \A). In 
that case, since we do not interact with box \B) or \C), we would have to 
conclude that the state that remains after we didn't find it in \A) is propor- 
tional to \B) + \C). But this is orthogonal to the post-selection (which we 
know will definitely be obtained). Because this is a contradiction, we con- 
clude that the particle must be found in box A. Similarly, the probability to 
find the particle in box B is 1, i.e. Pr (Pb = 1) = 1. The "paradox" is, what 
"sense" can these 2 definite statements be simultaneously true. We cannot 
detect the distinction with ideal-measurements: e.g. Pr(P a = 1) = 1 if only 
box A is opened, while Pr (Pb = 1) = 1 if only box B is opened. If ideal- 
measurements are performed on both box A and box B, then obviously the 
particle will not be found in both boxes, i.e. PaPb = 0. 
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Figure 4: a) pre-selected vector |^ in ) = l/y/3 (\A)+\B) 
in time from i; n to t\, and post-selected vector |\Pfin) 
propagating backwards in time from tg n to t^. b) ideal-measurement of P,4 at t\ 
and of at ti. 
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1.2.3 Counterfactuals 

There is a widespread tendency to "resolve" these paradoxes by pointing out 
that there is an element of counter-factual reasoning: the contradictions arise 
only because inferences are made that do not refer to actual experiments. 
Had the experiment actually been performed, then standard measurement 

8 The mystery is increased by the fact that both Pa and Pb commute with each other, so 
one may ask "how is it possible that measurement of one box can disturb the measurement 
of another?" 
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theory predicts that the system would have been disrupted so that no para- 
doxical implications arises. Suppose we applied this to the 3-box-paradox: 
the resolution then is that there is no meaning to say that the particle is in 
both boxes without actually measuring both boxes during the intermediate 
time. 

We have proven [TTJ [32j that one shouldn't be so quick in throwing away 
counter-factual reasoning; though indeed counter-factual statements have 
no observational meaning, such reasoning is actually a very good pointer 
towards interesting physical situations. Without invoking counter-factual 
reasoning, we have shown that the apparently paradoxical reality implied 
counter-factually has new, experimentally accessible consequences. These ob- 
servable consequences become evident in terms of weak measurements, which 
allow us to test - to some extent - assertions that have been otherwise re- 
garded as counter-factual. 

The main argument against counter-factual statements is that if we ac- 
tually perform ideal-measurements to test them, we disturb the system sig- 
nificantly, and such disturbed conditions hide the counter-factual situation, 
so no paradox arises. TSQM also provides some novel insights for this 
"disturbance-based-argument". E.g., for the spin- 1/2 case ( §1.2.11) . if we ver- 
ify <7 X at t = ti and a Y at t = t 2 , t- m < t\ < t 2 < tfi n , then Pr(a x = +1) = 1 
and Pr(a y = +1) = 1 are simultaneously true. But if we switch the order and 
perform a y before a x , then Pr[a x = +1) = 1 and Pr(a y = +1) = 1 are not 
simultaneously true, since measuring a y at time t — t\ would not allow the 
information from the earlier (t- m < t) pre-selection of cr x = +1 to propagate 
to the later time (£2 > ti > £in) of the <j x measurement. As a consequence, 
the cx x measurement at time t 2 would yield both outcomes a x = ±1@. So, in 
general, the finding that a x = +1 with certainty or a y = +1 with certainty 
in the pre-and-post-selected ensemble only held when one of these two mea- 
surements was performed in the intermediate time, not both. Therefore, we 
should not expect both a y = +1 and <j x = +1 when measured simultaneously 
through <j£=45°. 

For the spin-1/2 case, the ABL- assignment relied on only the pre-or-post- 
selection, while in the 3-box-paradox, the ABL assignment relies on both the 
pre-and-post-selection. However, ABL still only gives an answer for one 

9 The same argument applies in the reverse direction of time. The 4-outcomes are con- 
sistent with l^in) = 1 5a; = +1) and l^fln) = |<5y = +1)- Physically, the ideal-measurement 
of <j£ exposes the particle to a magnetic field with a strong gradient in the £ = 45° direction, 
which causes the spin to revolve around this axis in an uncertain fashion. 
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actual ideal-measurement. What happens if we tried to obtain two answers 
for the 3-box-paradox? In order to deduce Pa = 1, we used information 
from both pre-and-post-selected vectors. When we actually measured Pa, 
then this ideal-measurement will limit the "propagation" of the 2- vectors that 
were relied on to make this determination (see fig. Hlb). If we subsequently 
were to measure Pb, then the necessary information from both the pre- 
and-post-selected vectors is no longer available (i.e. information from t in 
cannot propagate beyond the ideal-measurement of Pa at time t\ due to the 
disturbance caused by the ideal- measurement of Pa)- Thus, even though Pa 
and Pb commute, ideal-measurements of one can disturb ideal-measurement 
of the other^ 

Since we have understood the reason why both statements are not si- 
multaneously true as a result of disturbance, we can now see the "sense" 
in which the definite ABL assignments can be simultaneously relevant. Our 
main argument is that if one doesn't perform absolutely precise (ideal) mea- 
surements but is willing to accept some finite accuracy, then one can bound 
the disturbance on the system. For example, according to Heisenberg's un- 
certainty relations, a precise measurement of position reduces the uncertainty 
in position to zero Ax = but produces an infinite uncertainty in momen- 
tum Ap = oo. On the other hand, if we measure the position only up to 
some finite precision Ax = A we can limit the disturbance of momentum 
to a finite amount Ap > %/A. By replacing precise measurements with a 
bounded-measurement paradigm, counter-factual thought experiments be- 
come experimentally accessible. What we often find is that the paradox 
remains - measurements produce surprising and often strange, but never- 
theless consistent structures. With limited-disturbance measurements, there 
is a sense in which both Pr(a x = +1) = 1 and Pr(a y = +1) = 1 are 
simultaneously relevant because measurement of one does not disturb the 
other. Since measurement of also can be understood as a simultaneous 
measurement of a x and a y , we will see that with limited-disturbance mea- 
surements, we can simultaneously use both a x — +1 and a y — +1 to obtain 

10 This is related to a violation of the product rule. In general, if is an eigenvector 
of A with eigenvalue a and \^2) is an eigenvector of B with eigenvalue b and [A, B] = 0, 
then if A and B are known only by either pre-selection or post-selection, then the product 
rule is valid, i.e. AB = ab. However if A and B are known by both pre-selection and 
post-selection, then the product rule is not valid, i.e. AB ^ ab, i.e. they can still disturb 
each other, even though they commute. [31] ) 
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2 TSQM has revealed new features and ef- 
fects: Weak-Measurements 

ABL considered the situation of measurements between two successive ideal- 
measurements where one transitions from a pre-selected state |^/in) to a post- 
selected state |Wfl n ). The state of the system at a time t e \t{ n , tfij, i.e. after 
t in when the state is \^m) and before t nn when the state is |^fi n ) is generally 
disturbed by an intermediate ideal- measurement. A subsequent theoretical 
development arising out of the ABL work was the introduction of the weak- 
value of an observable which can be probed by a new type of measurement 
called the weak-measurement [5]. The motivation behind these measure- 
ments is to explore the relationship between \^i n ) and |\l/ nn ) by reducing the 
disturbance on the system at the intermediate time. This is useful in many 
ways, e.g. if a weak-measurement of A is performed at the intermediate time 
t G [tin,tfin] then, in contrast to the ABL situation, the basic object in the 
entire interval tj n — > t nn for the purpose of calculating other weak-values for 
other measurements is the pair of states l^m) and |\l/nn)- 

2.1 Quantum Measurements 

Weak-measurements [5] originally grew out of the quantum measurement the- 
ory developed by von Neumann [42j"l First we consider ideal-measurements 
of observable A by using an interaction Hamiltonian H int of the form H int = 
— \(t)Q md A where Q m d is an observable of the measuring-device (e.g. the 
position of the pointer) and X(t) is a coupling constant which determines the 
duration and strength of the measurement. For an impulsive measurement 
we need the coupling to be strong and short and thus take A(t) ^ only for 
t E (to — e, t +e) and set A = //°X e X(t)dt. We may then neglect the time evo- 
lution given by H s and H md in the complete Hamiltonian H = H s + md -hHi nt . 
Using the Heisenberg equations-of-motion for the momentum P md of the 



^^Weak-measurements and their outcome, weak-values, can be derived in all approaches 
to quantum measurement theory. E.g. the usual projective measurement typically utilized 
in quantum experiments is a special case of these weak- measurements |50j . 
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measuring-device (conjugate to the position Q md ), we see that P m d evolves ac- 
cording to ^f - = X(t)A. Integrating this, we see that P m d(T)-P md (0) = XA, 
where P m d(0) characterizes the initial state of the measuring-device and 
P m d(T) characterizes the final. To make a more precise determination of A re- 
quires that the shift in P m d, i.e. <5P m d = P m d(P) — P md (0), be distinguishable 
from it's uncertainty, AP m d. This occurs, e.g., if P m d(0) and P m d(T) are more 
precisely defined and/or if A is sufficiently large (see figure [Ha). However, 
under these conditions (e.g. if the measuring-device approaches a delta func- 
tion in P m d), then the disturbance or back-reaction on the system is increased 
due to a larger H int , the result of the larger AQ md (AQ md > -x-p — ). When 

aid 

A is measured in this way, then any operator O ([A, O] 7^ 0) is disturbed 
because it evolved according to -^O = iX(t)[A,0}Q ra d, and since XAQ m d is 
not zero, O changes in an uncertain way proportional to AAQ m d@ 

In the Schroedinger picture, the time evolution operator for the complete 
system from t = t — e to t = t +e is exp{— i f t °^ H{t)dt] = exp{— iXQ m dA}. 
This shifts P m d (see figure EJa). If before the measurement the system was 
in a superposition of eigenstates of A, then the measuring-device will also be 
superposed proportional to the system. This leads to the "quantum measure- 
ment problems," discussed in (26] . A conventional solution to this problem 
is to argue that because the measuring-device is macroscopic, it cannot be 
in a superposition, and so it will "collapse" into one of these states and the 
system will collapse with it. 

2.1.1 Weakening the interaction between system and measuring 
device 

Following our intuition we now perform measurements which do not disturb 
either the pre-or-post-selections. The interaction H mt = — X(t)Q m dA is weak- 
ened by minimizing AAQ m d- For simplicity, we consider A 1 (assuming 
without lack of generality that the state of the measuring-device is a Gaussian 
with spreads AP md = AQ md = 1). We may then set e~ lXQuidA w 1 — i AQmd A 

12 E.g. in the spin-1/2 example, the conditions for an ideal-measurement SP^ ld = Xcr^ S> 
AP^ d will also necessitate AQf nd ^> ^4- which will thereby create a back-reaction causing 
a precession in the spin such that AO 1 (i.e. more than one revolution), thereby 
destroying (i.e. making completely uncertain) the information that in the past we had 
a x = +1, and in the future we will have a y = +1. 
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a) System 



Measuring Device 



t 

time t 
I 



b) System Measuring Device 



fin 



Mail 




^"liid 



'Pmd' ' 

5P md = Aai> AP md 
Ideal "strong" measurement 



fin 




|*fa) 5Pmd = AA w <AP md 



Weak- measurement 



Figure 5: a) with an ideal or "strong" measurement at t (characterized e.g. by 
SPmd = Azi ^ A^md), then ABL gives the probability to obtain a collapse onto 
eigenstate a\ by propagating (Q/fi n | backwards in time from ig n to t and ^\ r ) for- 
wards in time from t\ a to t; in addition, the collapse caused by ideal-measurement 
at t creates a new boundary condition |ai)(cti| at time t G [tinjifin]; b) if a weak- 
measurement is performed at t (characterized e.g. by 5P m d = XA W <C AP md ), then 
the outcome of the weak- measurement, the weak-value, can be calculated by prop- 
agating the state (^finl backwards in time from tfj n to t and the state I* in) forwards 
in time from i; n to t; the weak-measurement does not cause a collapse and thus no 
new boundary condition is created at time t. 



and use a theorem |49) 13 l : 

A\V) = (A)\V) + AA\* ± ), (2.8) 
to show that before the post-selection, the system state is: 

e -<AQ md i^. n) = (i_u4 nd i)|^ in > = (l- i\4nd(4»l*in) - Ag md Ai|^ in± ) 

(2-9) 

Using the norm of this state || (1 - i\Q md A)\^ in ) || = 1 + X 2 Q 2 md (A 2 }, the 
probability to leave \^- m ) un-changed after the measurement is: 



1 (A — > 0) 

while the probability to disturb the state (i.e. to obtain \^i n ±)) is: 



(2.10) 



i+A 2 e d w 



(A —> 0) 



(2.11) 



13 where (A) = |*) is any vector in Hilbert space, AA 2 = (^\(A - (A)) 2 \^), 

and 1*1 ) is a state such that (*|* i ) = 0. 
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The final state of the measuring-device is now a superposition of many 
substantially overlapping Gaussians with probability distribution given by 

Pr(Pmd) = Y,i | (ctf | \IZin) | 2 exp | — ( ' P 2Ap^ a '' > }• This sum is a Gaussian mixture, 

so it can be approximated by a single Gaussian l>^ n d (P md ) « (P md |e~* AQmd ^ |$™ d ) « 

exp |- (Pm ^~y ))2 | centered on A (A). 

2.1.2 Information gain without disturbance: safety in numbers 

It follows from eq. I2.lll th.at the probability for a collapse decreases as 0(A 2 ), 
but the measuring-device's shift grows linearly 0(A), so 5P m d = Aa.j. For a 
sufficiently weak interaction (e.g. A « 1), the probability for a collapse 
can be made arbitrarily small, while the measurement still yields informa- 
tion but becomes less precise because the shift in the measuring- device is 
much smaller than its uncertainty 5P m d <C A? m( j (figure 0b). If we per- 
form this measurement on a single particle, then two non-orthogonal states 
will be indistinguishable. If this were possible, it would violate unitarity 
because these states could time evolve into orthogonal states |$J" d ) — ► 
and Itfa)!**,) — > |* 2 )|*S d (2)), with |*i)|$Sd(l)) orthogo- 
nal to 1*2) |*md(2))- With weakened measurement interactions, this does 
not happen because the measurement of these two non-orthogonal states 
causes a smaller shift in the measuring-device than it's uncertainty. We con- 
clude that the shift <5P m d °f the measuring-device is a measurement error 
because lC(P md ) = (P md - A(i)fe) « (P md |$Sd) for A < 1. Neverthe- 
less, if a large (N > ensemble of particles is used, then the shift of all 
the measuring- devices {5P^\ ~ X(A)^- = N'(A)) becomes distinguishable 
because of repeated integrations, while the collapse probability still goes to 
zero. That is, for a large ensemble of particles which are all either |^ 2 ) or 
|\l/i), this measurement can distinguish between them even if |* 2 ) an d |*i) 
are not orthogona B 

Using these observations, we now emphasize that the average of any op- 
erator A, i.e. (A) = (*|A|*), can be obtained in three distinct cases [331185]: 

1. Statistical method with disturbance: the traditional approach 
is to perform ideal-measurements of A on each particle, obtaining a 
variety of different eigenvalues, and then manually calculate the usual 
statistical average to obtain (A). 

14 because the scalar product (^1 ) = cos " ^ — y 
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2. Statistical method without disturbance as demonstrated by us- 
ing = (A)\fy) + AA\ty±). We can also verify that there was no 
disturbance: consider the spin-1/2 example ( §1.2.ip . pre-selecting an 
ensemble, |t x ), then performing a weakened-measurement of and 
finally a post-selection again in the re-direction (figure [6]). For every 
post-selection, we will again find |f x ) with greater and greater cer- 
tainty (in the weakness limit), verifying our claim of no disturbance. 
Each measuring device is centered on |<jg| fx) = ^75 and the whole 
ensemble can be used to reduce the spread (figure [Tie). The weak- 
ened interaction for <jg means that the inhomogeneity in the magnetic 
field induces a shift in momentum which is less than the uncertainty 
SP^ d < AP^d an d thus a wave packet corresponding to Ux ^ v = 1 
will be broadly overlapping with the wave packet corresponding to 
ax ^ y = —I- A particular example is depicted in fig. [71a. (follow- 
ing |5T]) with <S>™ d (P md ) = (A 2 7r)- 1 /4 exp {_p2 d/2A 2 } and A = APmd 
now parametrizes the "weakness" of the interaction instead of A. In the 
ideal-measurement regime of A << 1, the probability distribution of 
the measuring-device is a sum of 2 distributions centered on eigenvalues 
±1, figure [7J a. 

p r (P md ) = cos 2 (7r/8)e- (Pmd - 1)2/A2 + sin 2 (7r/8) e - (Pmd+1)2/A2 (2.12) 

The weak regime occurs when A is larger than the separation between 
the eigenvalues of ±1 (i.e. A >> 1); e.g. [TJb. 

3. Non-statistical method without disturbance is the case where 

is the "eigenvalue" of a single "collective operator," A^ N ' = 
jf J2[L\ A\ (with Ai the same operator A acting on the i-th particle). Us- 
ing this, we are able to obtain information about without caus- 
ing disturbance (or a collapse) and without using a statistical approach 
because any product state I^W) becomes an eigenstate of the operator 
AW. To see this, we apply the theorem = + AA^^) 
to AWltfW), i.e.: 



N(A)\¥ N) ) + AA^T^f^)) 



(2.13) 



where (A) is the average for any one particle and the states \^^P(i)) are 
mutually orthogonal and are given by (*)) = l^)i|^)2---|^±)«---|^ / ) 
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That is, the ith state has particle i changed to an orthogonal state and 
all the other particles remain in the same state. If we further define a 
normalized state l^j^) = J2i ~m\^±(^)) then the last term of eq. 12.131 



is 



^|*f } ) and it's size is 



K If 



0. Therefore, \^ N> >) 



becomes an eigenstate of A^ N \ with the value (A) and not even a single 
particle has been disturbed (as N — > oo). 

In the last case, the average for a single particle becomes a robust property 
over the entire ensemble, so a single experiment is sufficient to determine the 
average with great precision. There is no longer any need to average over 
results obtained in multiple experiments. 



all I <7 T 
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weakejied measurement 
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-1) 



-1} 
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Figure 6: Obtaining the average for an ensemble. 



particle N 



Tradition has dictated that when measurement interactions are limited 
so there is no disturbance on the system, then no information can be gained. 
However, we have now shown that when considered as a limiting process, the 
disturbance goes to zero more quickly than the shift in the measuring-device, 
which means for a large enough ensemble, information (e.g. the expectation 
value) can be obtained even though not even a single particle is disturbed. 
This viewpoint thereby shifts the standard perspective on two fundamental 
postulates of QM@ 



15 This is also helpful to understand the quantum to classical transition because typical 
classical interactions involve these collective obscrvables which do not disturb each other. 
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Figure 7: "Spin component measurement without post-selection. Prob- 
ability distribution of the pointer variable for measurement of erg when the parti- 
cle is pre-selected in the state \] x )- (a) Strong measurement, A = 0.1. (6) Weak 
measurement, A = 10. (c) Weak-measurement on the ensemble of 5000 parti- 
cles. The original width of the peak, 10, is reduced to 10/\/5000 ~ 0.14. In the 
strong measurement (a) the pointer is localized around the eigenvalues ±1, while 
in the weak-measurements (6) and (c) the peak is located in the expectation value 
(TxNU = 1/V2." From [EE] 
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2.1.3 Adding a post-selection to the weakened interaction: Weak- 
Values and Weak-Measurements 

Having established a new measurement paradigm -information gain without 
disturbance- it is fruitful to inquire whether this type of measurement reveals 
new values or properties. With weak-measurements (which involve adding a 
post-selection to this ordinary -but weakened- von Neumann measurement), 
the measuring-device registers a new value, the weak-value. As an indication 
of this, we insert a complete set of states {|\&nn)j} into the outcome of the 
weak interaction of §2.1.11 (i.e. the expectation value {A)): 

(A) = (% 

(2.14) 

If we interpret the states |^/fi n )j as the outcomes of a final ideal-measurement 
on the system (i.e. a post-selection) then performing a weak-measurement 
(e.g. with AAQ md — > 0) during the intermediate time t G [t- m , tg n ], pro- 
vides the coefficients for K^Ajj 1 !/^)! 2 which gives the probabilities Pr(j) for 
obtaining a pre-selection of (^ in | and a post-selection of \^a n )j. The inter- 
mediate weak-measurement does not disturb these states and the quantity 
A w (j) = ^~[-^-y is the weak- value of A given a particular final post- 
selection (^fin \j. Thus, from the definition (A) = J2j P r (j) A w (j), one can 
think of (A) for the whole ensemble as being constructed out of sub-ensembles 
of pre-and-post-selected-states in which the weak-value is multiplied by a 
probability for a post-selected-state. 

The weak-value arises naturally from a weakened measurement with post- 
selection: taking A << 1, the final state of measuring-device in the momen- 
tum representation becomes: 

\ ^ fin I ™ in / 
« (^nn|^in)(P m d|e^ A<5A 1$r) 

-> (^ fin |^ in )exp{-(P md -AA w ) 2 } (2.15) 



where A 



<*nn 


A\v in ) 


(*fin| 





(*fin 


J 


A 


*in> 


(*fin 
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The final state of the measuring-device is almost un-entangled with the sys- 
tem; it is shifted by a very unusual quantity, the weak-value, A w , which is 
not in general an eigenvalue of We have used such limited disturbance 
measurements to explore many paradoxes (see, e.g. [HI [32]). A number 
of experiments have been performed to test the predictions made by weak- 
measurements and results have proven to be in very good agreement with 
theoretical predictions [Ml ESI ESI ESI E2] • Since eigenvalues or expectation 
values can be derived from weak- values [H] , we believe that the weak- value 
is indeed of fundamental importance in QM. In addition, the weak-value is 
the relevant quantity for all generalized weak interactions with an environ- 
ment, not just measurement interactions. The only requirement being that 
the 2-vectors, i.e. the pre-and-post-selection, are not significantly disturbed 
by the environment. 



2.2 Fundamentally new features of weak- values 
2.2.1 Weak- values and 3-b ox-paradox 

Returning to the 3-box-paradox ( §1.2.21) . we can calculate the weak- values of 
the number of particles in each box, e.g.: 

r iM M n (* fl n|A)(A|tt in ) ^{(A\HB\-(C\}\A) { A\^ { \A) + \B) + \C)} ^ 

<* fin |* fln > ^ {{ A\ + (Bh(C\}^{\A)+\B)+\C)} ~ ±(1+1-1) 

However, we can more easily ascertain the weak-values without calculation 
due to the following theorems: 

Theorem 1 : The sum of the weak- values is equal to the weak- value of the 
sum |64j : 

if (P A )w = (Pb + Pc)w then (P A ) W = (P B ) W + (P c ) w (2.16) 



Theorem 2 [65]: If a single ideal-measurement of an observable Pa is per- 
formed between the pre-and-post-selection, then if the outcome is definite 
(e.g. Prob(P a = 1) = 1) then the weak-value is equal to this eigenvalue (e.g. 
(Pa)w = 1) ®- 

This also provides a direct link to the counterfactual statements ( §1.2.31) 
because all counterfactual statements which claim that something occurs 



Thereby challenging another fundamental postulate of QM. 
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with certainty, and which can actually be experimentally verified by sep- 
arate ideal-measurements, continue to remain true when tested by weak- 
measurements. However, given that weak-measurements do not disturb each 
other, all these statements can be measured simultaneously. 

Applying Theorem 2 to the 3-box-paradox, we know the following weak- 
values with certainty: 

(P A ) W = 1, (Pb)w = 1, Ptotai = (Pa + Pb + Pc)w = 1. (2.17) 
Using theorem 1, we obtain: 

(frfin|Ptofq;-PA-PB|frin) 
(^finl^in) 

(P A + Pb + P C )w - (P A )w - (Pb)w = -L (2.18) 

This surprising theoretical prediction of TSQM has been verified experimen- 
tally using photons [39]. What interpretation should be given to (Pc)w = 
— 1? Any weak- measurement which is sensitive to the projection operator 
Pc will register the opposite effect from those cases in which the projection 
operator is positive, e.g. a weak-measurement of the amount of charge in 
box C in the intermediate time will yield a negative charge (assuming it is 
a positively charged particle). For numerous reasons, we believe the most 
natural interpretation is: there are —1 particles in box C . 

2.2.2 How the weak-value of a spin-1/2 can be 100 

The weak- value for the spin-1/2 considered in §1.2.11 (which was confirmed 
experimentally for an analogous observable, the polarization [54] ) is: 

,» . <t„ \^$t\ Tx) _ {(T y \o y ] + R| T,)} (t„ |i + i|T,) ft 

(2.19) 

Normally, the component of spin <7| is an eigenvalue, ±1, but the weak-value 
(<5"|)w = \/2 is \/2 times bigger, (i.e. lies outside the range of eigenvalues of 
a ■ n) 0. How do we obtain this? Instead of post-selecting a x — 1 (figure 

17 Weak-values even further outside the eigenvalue spectrum can be obtained by post- 
selecting states which are more anti-parallel to the pre-selection: e.g. if we post-select 
the +1 eigenstate of (cosa)^ + (sina)cr z , then (<7 2 ) w = A tan yielding arbitrarily large 
values such as spin- 100. 



(P C )w = 
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El), we post-select a y — 1 which will be satisfied in one-half the trials (figure 
Hi! 



either 
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or | (T y = — 1) 
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Figure 8: Statistical weak-measurement ensemble. 



o-« = +1, 



particle N 



To show this in an actual calculation, we use eq. 12.161 and the post- 
selected state of the quantum system in the basis (| \ y ) = cos(7r/8)| 
) — sin(7r/8)| the measuring-device probability distribution is: 



Pr{P, 



mdj 



N 2 



cos 



(7r/8)e 



-(Pmd-1) 2 /A 2 



Sill 



(7r/8)e 



-(P md +1) 2 /A 2 l2 



(2.20) 



With a strong or ideal- measurement, A < 1, the distribution is localized 
again around the eigenvalues ±1, as illustrated in figures [9ja andEjJb, similar 
to what occured in figure [7Ja. What is different, however, is that when 
the measurement is weakened, i.e. A is made larger, then the distribution 
changes to one single distribution centered around a/2, the weak-value, as 
illustrated in figures EJc-f, (the width again is reduced with an ensemble 
IHJf). Using eq. 12.144 we can see that the weak- value is just the pre-and-post- 
selected sub-ensemble arising from within the pre-selected-only ensembles. 
That is, [9jf is a sub-ensemble from the full ensemble represented by the 
expectation value, figure [71c. 

The non-statistical aspect mentioned in case-3 ( §2.1. 21) can also be ex- 
plored by changing the problem slightly. Instead of considering an en- 
semble of spin-1/2 particles, we now consider "particles" which are com- 
posed of many (N) spin-1/2 particles, and perform a weak-measurement of 



18 If a post-selection does not satisfy & y = +1, then that member of the sub-ensemble 
must be discarded. This highlights a fundamental difference between pre-and-post- 
selection due to the macrosopic arrow-of-time: in contrast to post-selection, if the pre- 
selection does not satisfy the criteria, then a subsequent unitary transformation can trans- 
form to the proper criteria. 
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Figure 9: "Measurement on pre-and-post-selected ensemble. Probability 
distribution of the pointer variable for measurement of erg when the particle is pre- 
selected in the state \] x ) and post-selected in the state \] y )- The strength of the 
measurement is parameterized by the width of the distribution A. (a) A = 0.1; 
(6) A = 0.25; (c) A = 1; (d) A = 3; (e) A = 10. (/) Weak-measurement on 
the ensemble of 5000 particles; the original width of the peak, A = 10, is reduced 
to 10/\/5000 ~ 0.14. In the strong measurements (a)-(b) the pointer is localized 
around the eigenvalues ±1, while in the weak-measurements (d)-(f) the peak of 
the distribution is located in the weak-value (a^) w = (Ty| £7 $lTa;)/(Tj/|Tx) = v2- The 
outcomes of the weak-measurement on the ensemble of 5000 pre-and-post-selected 
particles, (/), are clearly outside the range of the eigenvalues, (-1,1)." From [51] 
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the collective observable er^ = Dili &\ in the 45°-angle to the x — y 
plane. Using H int = — ^p-Qmd Xali a particular pre-selection of |f a .) 
(i.e. = njlx IT.),) and post-selection (i.e. (^| = n£=i<Ty|fc = 

n^Li {(T«|n + z(jz|n})) the final state of the measuring-device is: 

N ( \ N \ N 

« D > = n<uex P U4ndE^ n ir«>*io (2.2i) 

3=1 l iV k=l ) i=l 

Since the spins do not interact with each other, we can calculate one of the 
products and take the result to the Nth power: 

N f \ i r r a i ^ N 



«°> = n<a-exp|-Q md ^j = |(T,|ex P |-g md a e | |T.)J |<0 

(2.22) 

Using the following identity exp {iaa^} = cosa + ia^sina [57], this becomes: 



cos 20g sin 



md 



N 5 JV 



JV 



= [(T,IU] 7V |cos^-za.sin^| |$D (2.23) 

where we have substituted a w = (^) 1U = . We consider only the 

second part (the first bracket, a number, can be neglected since it does not 
depend on Q and thus can only affect the normalization): 

|C D ) = (l - - iXaw ® md \ l^we^lO (2-24) 



When_j projected onto P m d, i.e. the pointer, we see that the pointer is 
robustly shifted by the the same weak-value obtained with the previous sta- 
tistical method, i.e. y/2: 



19 The last approximation was obtained as N — > oo, using (1 + j^) N = (1 + ^) ™ a « e a . 



26 



A single experiment is now sufficient to determine the weak- value with great 
precision and there is no longer any need to average over results obtained 
in multiple experiments as we did in the previous section. Therefore, if we 
repeat the experiment with different measuring-devices, then each measuring- 
device will show the very same weak-values, up to an insignificant spread 
of ^= and the information from both boundary conditions, i.e. |^in) = 

Ilili \Vx)i an d (^finl = riiIi(Ti/|j; describes the entire interval of time between 
pre-and-post-selection. Following j5fj, we consider an example with N = 20. 
The probability distribution of the measuring-device after the post-selection 
is: 

N 

F^Q2)=A/- 2 (£(-l)W(^ 
i=i 

(2.26) 

and is drawn for different values of A in figure [TUl While this result is rare, we 
have recently shown [33] how any ensemble can yield robust weak- values like 
this in a way that is not rare and for a much stronger regime of interaction. 
We have thereby shown that weak-values are a general property of every 
pre-and-post-selected ensemble q 

2.2.3 Hardy's Paradox 

Another surprising pre-and-post-selection effect is Hardy's gedanken-experiment 
which is a variation of interaction- free measurements (IFM) [58], consisting of 
two "superposed" Mach-Zehnder interferometers (MZI) (figure [TTj) . one with 
a positron and one with an electron. Consider first a single interferome- 
ter, for instance that of the positron (labeled by +). By adjusting the arm 
lengths, it is possible to arrange specific relative phases in the propagation 
amplitudes for paths between the beam-splitters BS1 + and BS2 + so that 
the positron can only emerge towards the detector C + . However, the phase 
difference can be altered by the presence of an object, for instance in the 
lower arm, in which case detector D + may be triggered. In the usual IFM 
setup, this is illustrated by the dramatic example of a sensitive bomb that 
absorbs the particle with unit probability and subsequently explodes. In this 
way, if D + is triggered, it is then possible to infer the presence of the bomb 

20 We have also proposed this as another innovative new laser-technology, e.g. in the 
amplification of small non-random signals by minimizing uncertainties in determining the 
weak value and by minimizing sample size. [33] 
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Figure 10: "Measurement on a single system. Probability distribution of 
the pointer variable for the measurement of A = (X)i=i(°"i)^)/20 when the system 
of 20 spin-i particles is pre-selected in the state = Yll=i \1x)i and post- 

selected in the state = Y\i=i lty)«- While in the very strong measurements, 
A = 0.01 — 0.05, the peaks of the distribution located at the eigenvalues, starting 
from A = 0.25 there is essentially a single peak at the location of the weak-value, 
A w = y/2." From [H] 
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without "touching" it, i.e., to know both that there was a bomb and that 
the particle went through the path where there was no bomb. 




Figure 11: a) counterfactual r 



disturbs the Electron 




VNO + 



e electron 



could end up in the D detector even if no positron were present in the overlapping 
arm, b) electron must be on the overlapping path Nq = 1 
be on overlapping path Nq = 1 



c) positrcm also must 



In the double-MZI, things are arranged so that if each MZI is considered 
separately, the electron can only be detected at C~ and the positron only at 
C + . However, because there is now a region where the two particles overlap, 
there is also the possibility that they will annihilate each other. We assume 
that this occurs with unit probability if both particles happen to be in this 
region. 

According to QM, the presence of this interference-destroying alternative 
allows for a situation similar to IFM in which detectors D~ and D + may 
click in coincidence (in which case, obviously, there is no annihilation). 

Suppose D~ and D + do click. Trying to "intuitively" understand this 
situation leads to paradox. For example, we should infer from the clicking of 
D~ that the positron must have gone through the overlapping arm; otherwise 
nothing would have disturbed the electron, and the electron couldn't have 
ended in D~ . Conversely, the same logic can be applied starting from the 
clicking of D + , in which case we deduce that the electron must have also 
gone through the overlapping arm. But then they should have annihilated, 
and couldn't have reached the detectors. Hence the paradox. 

These statements, however, are counter-factual, i.e. we haven't actually 
measured the positions. Suppose we actually measured the position of the 



29 



electron by inserting a detector T>q in the overlapping arm of the electron- 
MZI. Indeed, the electron is always in the overlapping arm. But, we can no 
longer infer from a click at D~ that a positron should have traveled through 
the overlapping arm of the positron MZI in order to disturb the electron 
(figure [TTla). The paradox disappears. 

As we mentioned ( §1.2.3p . weak- measurements produce only limited dis- 
turbance and therefore can be performed simultaneously, allowing us to ex- 
perimentally test such counter-factual statements. Therefore we would like 
to test [HI [32] questions such as "Which way does the electron go?" , "Which 
way does the positron go?" , "Which way does the positron go when the elec- 
tron goes through the overlapping arm?" etc. In other words, we would like 
to measure the single-particle "occupation" operators 

N£o = |NO) p (NO| p iV+ = |0) p (0| p 

N^o = |NO) c (NO| c No = |0) c (0| c (2.27) 

which tell us separately about the electron and the positron. We note a most 
important fact, which is essential in what follows: the weak- value of a product 
of observables is not equal to the product of their weak-values. Hence, we 
have to measure the single-particle occupation-numbers independently from 
the pair occupation-operators: 

= ^NO^O ^O.NO = ^O^NO 

^oio = ^no"no = N+oN^o (2.28) 

These tell us about the simultaneous locations of the electron and positron. 
The results of all our weak-measurements on the above quantitites, echo, to 
some extent, the counter-factual statements, but go far beyond that. They 
are now true observational statements (and experiments have successfully 
verified these results [SS])- In addition, weak- values obey an intuitive logic of 
their own which allows us to deduce them directly. While this full-intuition 
is left to published articles [HI [32], we discuss the essence of the paradox 
which is defined by three counterfactual statements: 

• The electron is always in the overlapping arm. 

• The positron is always in the overlapping arm. 

• The electron and the positron are never both in the overlapping arms. 
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To these counterfactual statements correspond the following observational 
facts. In the cases when the electron and positron end up at D~ and D + 
respectively and if we perform a single ideal-measurement of: 

• Nq , we always find Nq = 1 (figure [TTlb). 

• Nq, we always find Nq = 1 (figure ITTlc). 

• Nq'q, we always find Nq'q = (figure fT2la). 

The above statements seem paradoxical but, of course, they are valid only 
if we perform the measurements separately; they do not hold if the mea- 
surements are made simultaneously However, Theorem 2 says that when 
measured weakly all these results remain true simultaneously: 

Nq w = 1, Nq- w = 1 (2.29) 

Using theorems 1 and 2, all other weak- values can be trivially deduced: 

Nno w = 0, N+ Qw = (2.30) 

N^; 0w = (2.31) 

No$o» = 1» N Po,o w = 1 (2-32) 

Nno,now = ~ 1- (2.33) 
What do all these results tell us? 

First of all, the single-particle occupation numbers (I2.29P are consistent 
with the intuitive statements that "the positron must have been in the over- 
lapping arm otherwise the electron couldn't have ended at D~" (figure [TTlc) 
and also that "the electron must have been in the overlapping arm other- 
wise the positron couldn't have ended at D +n (figure [TTJb). But then what 
happened to the fact that they could not be both in the overlapping arms 
since this will lead to annihilation? QM is consistent with this too - the pair 
occupation number ^ q w = shows that there are zero electron-positron 
pairs in the overlapping arms (figure [121 a)! 

We also feel intuitively that "the positron must have been in the overlap- 
ping arm otherwise the electron couldn't have ended at D~ , and furthermore, 
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the electron must have gone through the non-overlapping arm since there was 
no annihilation" (figure fT2lb). This is confirmed by Nq'^ q = 1. But we also 
have the statement "the electron must have been in the overlapping arm oth- 
erwise the positron couldn't have ended at D~ and furthermore the positron 
must have gone through the non-overlapping arm since there was no annihila- 
tion". This is confirmed too, N^fo 0w = 1. But these two statements together 
are at odds with the fact that there is in fact just one electron-positron pair 
in the interferometer. QM solves the paradox in a remarkable way - it tells 
us that A ? i |q _ NOw = —1, i.e. that there is also minus one electron-positron 
pair in the non-overlapping arms which brings the total down to a single pair 
(figure Oc)! 




2.3 Contextuality 

TSQM and WMs have proven very useful in exploring many un-settled as- 
pects of QM. For example, using TSQM, we have shown that it is possible 
to assign definite values to observables in a new way in situations involving 
"contextuality." Traditionally, contextuality was thought to be a requirement 
for certain hypothetical modifications of QM. However, using pre-and-post- 
selection and weak-measurements, we have shown that QM implies contex- 
tuality directly [691 EOl E2]. 

What is contextuality? Bell-Kochen-Specker (BKS) proved that one can- 
not assign unique answers (i.e. a Hidden- Variable-Theory, HVT) to yes-no 
questions in such a way that one can think that measurement simply reveals 
the answer as a pre-existing property that was intrinsic solely to the quantum 
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system itself. BKS assumed that the specification of the HVT, i.e. V?(A), 

should satisfy: V^(F{A}) = F{Vj(A)}, i.e. any functional relation of an 
operator that is a member of a commuting subset of observables must also 
be satisfied if one substitutes the values for the observables into the func- 
tional relations. A consequence of this is satisfaction of the sum and product 
rules and therefore BKS showed that with any system (of dimension greater 
than 2) the 2 n possible "yes-no" assignments (to the n projection operators 
representing the yes-no questions) cannot be compatible with the sum and 
product rules for all orthogonal resolutions of the identity. Thus, a HVT-the 
hypothetical modification of QM-must be contextual. 

In [32] it was first pointed out and extensively discussed and later proven [28] 
that whenever there is a logical pre-and-post-selection-paradox (as in the 3- 
box-paradox § 1.2.2j) . then there is a related proof of contextuality. However, 
the elements in the proof are all counter-factual. TSQM has taught us that 
by applying theorems 1 and 2, we can for the first time obtain an experimental 
meaning to the proof. 

By way of example, we consider Mermin's version of BKS with a set of 9 
observables. It is intuitive |30j to represent all the "functional relationships 
between mutually commuting subsets of the observables," i.e. Vr(F{A}) = 

F{V^(A)}, by drawing them in fig. 13 and arranging them so that all the 
observables in each row (and column) commute with all the other observables 
in the same row (or column). 
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Figure 13: 4-D BKS example 

V$(F{A}) = F{V^(A)} requires that the value assigned to the product of 
all three observables in any row or column must obey the same identities that 
the observables themselves satisfy, i.e. the product of the values assigned to 
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the observables in each oval yields a result of +1 except in the last column 
which gives -10 

E.g. computing column 3 of fig. 13: 



r«l«2l f"!"'! r"l»'l "1 -2-1 -2-1-2 -1-1-2-2-1-2 

\ a x a x)\ a y a y)\ (J Z (r Z ) = a x °x U y r y° ' z° ~z = Vj^/j ' z° ~z 

- ib\ib\b 2 z bl = -\ (2.34) 



commute so' 




commute so 



Computing the product of the observables in the third row, i.e.: 

r - 1 -2i r -2 -l-i r - 1 -2\ - 1 - 2 - 2 - 1 r - 1 - 2\ -1-1/ r - 1 - 2\ 

2<x 2 =i(j 1 commute so^ 

= ^{-^ 2 }R 2 } = +1> (2-35) 



If the product rule is applied to the value assignments made in the rows, 
then: 

V * ' " v ' " v ' 

row 1 row 2 row 3 

(2.36) 

while the column identities require: 

V&i)Vffi)Vffiaft = Vj(al)Vj(al)Vj(a 2 x al) = +1 

V v ' " * ' 

column 1 column 2 

V^lal)V^ay y )V^ala 2 z ) = -1 (2.37) 

S v ' 

column 3 

However, it is easy to see that the 9 numbers cannot satisfy all 6 con- 
straints because multiplying all 9 observables together gives 2 different re- 
sults, a +1 when it is done row by row and a —1 when it is done column by 

21 The value assignments are given by Vt(&].) = (a^ ® I 2 } , V?{a%) = (I 1 (g) a 2 )... 
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column: 



row 1 



row 2 



row 3 



(2.38) 



column 1 



ip\ w yJ r i> 

column 2 



column 3 



(2.39) 

There obviously is no consistent solution to eqs. 12.391 and 12.381 since they 
contain the same set of numbers, simply ordered differently. Therefore the 
values assigned to the observables cannot obey the same identities that the 
observables themselves obey, V^(F{A}) ^ F{V^(A)}, and an HVT would 
have to assign values to observables in a way that depended on the choice of 
which of 2 mutually commuting sets of observables that were also chosen to 
measure, i.e. the values assigned are contextual. For example, the assignment 
o\a\ = ±1 depends on whether we associate o\o 2 z with row-3 or with column- 
3. 

We briefly summarize application of TSQM to the Mermin example[69i 
[70| [32] . A single pre-and-post-selection (figure [HJ) allows us to assign a 
definite value to any single observable in figure 13. That by itself is new and 
surprising. 

(a) (b) 
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w = 1 i 




0-1 = 1 = lj 
OyOl = 1 



ffi = 1, a 

•~ 1 "'2 
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aia^ = 1 



\°l = 1) \*l = 1) 



cr 



I = 1) l*J = 1) 



Figure 14: pre-and-post-selection states for Mermin example. 



Moreover, for "contextuality," we must determine how many of the prod- 
ucts of the 9 observables in figure 13 can be ascertained together with cer- 
tainty. In order to ascertain the products of any 2 pairs, the generalized 
state is required, an outcome that Mermin describes as "intriguing" [29J. 



35 



The generalized state is defined by[6]: \17 = 1 1 LJ The outcome 



t 



+ 



correlated correlated 



.+ 



I$l> 



1*2) 



l$3> 



l$n> 



Figure 15: Generalized State: superpositions of 2-vectors. 



for the product of the first two observables in column 3 of figure 13 with the 
pre-and-post-selection of fig. dUa is a\a 2 x a\ j a'^ = +1. However, if we measure 
the operators corresponding to the first 2 observables of row 3 in figure 13, 
i.e. fr x fryfr x fry, given this particular pre-and-post-selection shown in fig. [THa. 
then the sequence of measurements interfere with each other (as represented 
by the slanted ovals in figure IT71a). To see this, consider that fr\fr\fr\fr\ cor- 
responds to the sequence of measurements represented in figure [TBI a. While 
the pre-selection of particle 2 is fr 2 x = 1 at t in , the next measurement after 
the pre-selection at is for fry and only after that a measurement of fr x 
is performed at £3. Thus, there is no guarantee that the fr x measurement 
at t 3 will give the same value as the pre-selected state of fr 2 — 1 or that 
the fry measurement will give the same value as the post-selected state of 
fry = 1. In TSQM, this is due to the disturbance of the 2-vector boundary 
conditions which is created by the ideal-measurement ( §2.11) : the initial pre- 
selected vector fr x = l from t in is "destroyed" when the fry measurement at 
time t 2 is performed and therefore cannot inform the later fr x measurement 
at time £3. In other words, with the particular pre-and-post-selection given 
in fig. [THa and [16la, the operator, fr\aifr 2 x fr 1 depends on information from 



22 This correlated state can be created by preparing at t; n a correlated state J^i <*t|^i)K) 
with \i) an orthonormal set of states of an ancilla. Then the ancilla is "guarded" so there 
are no interactions with the ancilla during the time (U n ,tfi„). At tfi n we post select on 
the particle and ancilla the state -7= J2i |^») I*)- If wc are successful in obtaining this state 
for the post-selection, then the state of 2 particles is described in the intermediate time by 
the entangled state (see figure 115]) . This is yet another example of a useful generalization 
of QM. 
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both the pre-selected vector a\ — 1, a 2 — 1 and the post-selected vector 
&y — 1, &y — 1 in a "diagonal-pre-and-post-selection" sense. We call this 
diagonal-pre-and-post-selection because a line connecting <5"^(ti) with cr 2 (t 3 ) 
will be diagonal or will cross the line connecting cr^^) with a^t^), where 
t in < t\ < t<2... < tfl n , see fig. ITTla). However, o\a 2 z is assigned differ- 
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Figure 16: Time sequence of pre-and-post-selection measurements for Mermin 
example. 




Figure 17: a) Measurement of &l.cFya 2 cr y is diagonal, b) measurement of a^a^ayCFy 
is diagonal 

ent values in different pre-and-post-selections. It is precisely because of this 
connection between particular pre-and-post-selections and different values 
for o\b\ that the issue of contextuality arises when we consider products of 
these observables. In other words, the contextuality here is manifested by the 
fact that cr\b y \<j 2 x o\ j = — 1 (given the pre-and-post-selection of fig. [16la) even 
though separately cr\o 2 = +1 and o^a^ = +1. But these 3 outcomes can be 
measured weakly without contradiction because the product of weak-values 
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is not equal to the weak-value of the product. Therefore, instead of contex- 
tuality being an aspect of a hypothetical replacement for QM (the HVT), we 
have shown that contextuality is directly part of QM [691 EH [32] . 

2.4 Nonlocality 

Traditionally, it was believed that "contextuality" was very closely related 
to "kinematic-nonlocality." Typically, kinematic-nonlocality refers to correla- 
tions, such as eq. II. lj , that violate Bell's-inequality with the consequence that 
QM cannot be replaced with a local realistic model. Similarly, contextuality 
refers to the impossibility of replacing QM with a noncontextual realistic the- 
ory. Applying this now to the relativistic-paradox ( §l.lj) . we see that Lorentz 
covariance in the state-description can be preserved in TSQM [9] because the 
post-selected vector af = +1 propagates all the way back to the initial prepa- 
ration of an EPR state, eq. EU \^ EPR ) = j- {|T)a |I)b- |I)a |T)b}- E.g. if 
A changes his mind and measures instead of or if we consider a differ- 
ent frame-of-reference, then this would change the post-selected vector all the 
way back to \^epb)- More explicitly, suppose the final post-selected-state 

is (v fin \ = j= 2 (u \ A {(i \ B + (t |b} = ^ {(T, U(T \b + (h Mi \b}- The 

full state-description is the bra-ket combination (which is not just a scalar 
product): 

(y f in\\y E pR) = ^ {(T MU +(T Mi \b} ^ {IT)a \l) B - \1)a ItM 

(2.40) 

There is no longer a need to specify a moment when a non-local collapse 
occurs, thereby removing the relativistic paradox. 

Finally, TSQM and weak-measurements also provide insight into a very 
different kind of non-locality, namely dynamical-nonlocality, e.g. that of the 
Aharonov-Bohm (AB) effect p]. We have shown how this novel kind of 
nonlocality can be measured with weak- measurements |71j . 
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3 TSQM lead to new mathematics, simplifi- 
cations in calculations, and stimulated dis- 
coveries in other fields 



TSQM has influenced work in many areas of physics, e.g. in cosmology 
IHTj , in black- holes [2TI |4*4"] , in superluminal tunneling [19], [38] , in quantum 
information [8], [821 [83], etc. We review two examples here. 

3.1 Super-oscillations 

Superoscillations [60] are functions which oscillate with an arbitrarily high 
frequency a, but which, surprisingly, can be understood as superpositions of 
low frequencies, \k\ < 1, seemingly a violation of the Fourier theorem: 

J2 c k e ikx -> e iax (3.41) 
|fc|<i 

Superoscillations were originally discovered through the study of weak- values. 
By way of example, consider again eq. 12.231 

{ N 

iX <Smd iX Qmd iX Qmd iX Q m d \ N 

+ a w > |$ in ) 

' e ^d (l + a w ) + e _^d (l-a w ) ) N |$ m D) (3 42) 



We already saw how this could be approximated as e lXaw ® md |$^ D ) which pro- 
duced a robust-shift in the measuring-device by the weak- value a/2. However, 

1 • // \ f ihRmd (l+ a ,„) iX Qmd fl— q„,1 1 ^ ■ n . rr 

we can also view 7/>(sc) = < e jv 2 ' + e at a — > ma very different 
way, by performing a binomial expansion: 



J,( T \ _ yW (l+a m )"(l-^) JV -" N\ fmA4nd 1 ™ n 



-»AQ md (Af-n) 
N 
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= Elo Cn exp { ^ ld <>-A0 j = ^ ^ exp l^^l (3 43) 

We see that this wavefunction is a superposition of waves with small wavenum- 
bers \k\ < 1 (because — 1 < &^D. < \y p or a small region (which can 
include several wavelengths 2ir/a w , depending on how large one chooses N), 
i/j(x) appears to have a very large momentum, since a w can be arbitrarily 
large, i.e. a super-oscillation. Because these regions of superoscillations are 
created at the expense of having the function grow exponentially in other 
regions, it would be natural to conclude that the superoscillations would 
be quickly "over-taken" by tails coming from the exponential regions and 
would thus be short-lived. However, it has been shown that superoscillations 
are remarkably robust [61] and can last for a surprisingly long time. This 
has therefore led to proposed/practical applications of superoscillations to 
situations which were previously probed by evanescent waves (e.g. in the 
superresolution of very fine features with lasers). 

As we mentioned in the introduction, TSQM is a re-formulation of QM, 
and therefore it must be possible to view the novel effects from the traditional 
single-vector perspective. This is precisely what super-oscillations teach us. 
In summary, there are 2 ways to understand weak-values: 

• the measuring- device is registering the weak-value as a property of the 
system as characterized by TSQM 

• the weak- value is a result of a complex interference effect in the measuring- 
device; the system continues to be described with a single- vector pur- 
suant to QM 

Oftentimes, calculations are either much simplified or can only be performed 
by utilizing the first approach (e.g. when the measuring-device is classi- 
cal) [32]. 

3.2 Quantum Random Walk 

Another fundamental discovery arising out of TSQM is the Quantum- Random- 
Walk [8] which has also stimulated discoveries in other areas of physics (for a 

23 In [SSI [HI] j we uncover several new relationships between the physical creation of the 
high-momenta associated with the superoscillations, eccentric weak-values, and modular 
variables which have been used to model the dynamical non- locality discussed in §2.41 62, 

Enrrj]. 
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Figure 18: "Demonstration of an approximate equality given by J2n=o c nf(t — 
a n ) ~ f(t — a). The sum of a function shifted by the 14 values cn between and 1 
and multiplied by the coefficients, yields approximately the same function shifted 
by the value 10. The dotted line shows f(t); the dashed line shows f(t— 10); and 
the solid line shows the sum." From [5T] 



11 



review, see [67]). In the second bullet above, the measuring-device is shifted 
by the operator with it's N + 1 eigenvalues equally spaced between — 1 
and +1 [32]. How can a superposition of small shifts between —1 and 1 give 
a shift that is arbitrarily far outside ±1? The answer is that states of the 
measuring- device interfere constructively for P^2 = a w and destructively 
for all other values of P^J such that $^f(P) - A w ), the essence 

of quantum-random- walk [8]. If the coefficients for a step to the left or right 
were probabilities, as would be the case in a classical random walk, then 
iV steps of step size 1 could generate an average displacement of y/N, but 
never a distance larger than N. However when the steps are superposed 
with probability amplitudes, as with the quantum-random-walk, and when 
one considers probability amplitudes that are determined by pre-and-post- 
selection, then the random walk can produce any displacement. In other 
words, instead of saying that a "quantum step" is made up of probabilities, 
we say that a quantum step is a superposition of the amplitude for a step 
"to the left" and the amplitude for a "step to the right," then one can super- 
pose small Fourier components and obtain a large shift. This phenomenon 
is very general: if f(t — a n ) is a function shifted by small numbers a n , then 
a superposition can produce the same function but shifted by a value a well 
outside the range of a n : E n=0 °nf(t — a n ) ~ f(t — a). The same values of a n 
and c n are appropriate for a wide class of functions and this relation can be 
made arbitrarily precise by increasing the number of terms in the sum, see 
figure [HI The key to this phenomenon is the extremely rapid oscillations in 
the coefficients c n = ( 1 +«^) n a-^) — 2_^VL_ in J2^ =0 c n exp {i\Q md k n }- 

4 TSQM suggests generalizations of QM 

4.1 Reformulation of Dynamics: each moment a new 
universe 

We review a generalization of QM suggested by TSQM [3H [32] which ad- 
dresses the "artificial" separation in all areas of theoretical physics between 
the kinematic and dynamical descriptions. David Gross has predicted [23] 
that this distinction will be blurred as the understanding of space and time 
is advanced and indeed we have developed a new way in which these tradi- 
tionally distinct constructions can be united. We note [3U [32] that the de- 
scription of the time evolution given by QM does not appropriately represent 
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multi-time-correlations which are similar to Einstein-Podolsky-Rosen/Bohm 
entanglement (eq. II. ID but instead of being between two particles in space, 
they are correlations for a single particle between two different times. Multi- 
time-correlations, however, can be represented by using TSQM. As a con- 
sequence, the general notion of time in QM is changed from the current 
conceptual framework which was inherited from CM, i.e.: 

1) : the universe is viewed as unique, and the objects which in- 
habit it just change their state in time. In this view, time is 
"empty," it just propagates a state forward; the operators of the 
theory create the time evolution; 

to a new conceptual framework in which: 

2) : each instant corresponds to a new pair of Hilbert spaces, 
(i.e., each instant is a new degree of freedom; in a sense, a new 
universe); instead of the operators creating the time evolution as 
in the previous approach, an entangled state (in time) "creates" 
the propagation: a whole new set of structures within time is able 
to "propagate" a quantum state forward in time, 

This new approach has a number of useful qualities, e.g.: 1) the dynamics 
and kinematics can both be represented simultaneously in the same language, 
a single entangled vector (in many Hilbert spaces), and 2) a new, more fun- 
damental complementarity between dynamics and kinematics is naturally 
introduced. This approach also leads to a new solution to the measurement 
problem which we model by uncertain Hamiltonians. Finally, these consid- 
erations are also relevant to the problem of the "Now," which was succinctly 
expressed by Davies [75] as "why is it 'now' nowT 1 The kinematic-dynamics 
generalization [3U [32] suggests a new fourth approach to time besides the 
traditional "block universe," "presentism," and "possibilism" models. 

While we leave all details to our other publications [3U [32]), in brief, 
consider a spin-1/2 particle, initially polarized "up" along the z axis, and 
having the Hamiltonian H = 0. In this case the time evolution of the particle 
is trivial, 

\V(t)) = constant = \a z = 1). (4.44) 

To see the deficiency in representing multi-time-correlations, we will consider 
an isomorphism between the correlations for a single particle at multiple 
instants of time and the correlations between multiple particles at a single 
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instant of time. Therefore, we ask if we could prepare N spin-1/2 particles 
such that if we perform measurements on them at some time to we would 
obtain the same information as we would obtain by measuring the state of the 
original particle at N different time moments, t±, t2---£/v? Since the state of 
the original particle at all these moments is \a z = 1), one would suppose that 
this task can be accomplished by preparing the N particles each polarized 
"up" along the z axis, that is eq. 14.451 (see also fig. [T9"|) : 

Wz = = 1>2-K = 1)jv (4.45) 



Wz = l)l Wz = 1)2 Wz = 1)3 Wz = 1>JV 

I I • • • • 

t x 1 1 1 — J— 

particle 1 particle 2 particle 3 particle N 

Figure 19: N spin-1/2 particles all in the initial or pre-selected state of a z = +1. 

But this mapping is not appropriate for many reasons. One reason is 
that the time evolution (I4.44p contains subtle correlations (i.e. multi-time- 
correlations), which usually are not noticed, and which do not appear in 
the state 14.451 but which can actually be measured. It is generally believed 
that since the particle is at every moment in a definite state of the z-spin 
component, the z-spin component is the only thing we know with certainty 
about the particle - all other spin components do not commute with o z and 
cannot thus be well-defined. However, there are multi-time variables whose 
values are known with certainty, given the evolution (14.441) . For example, 
although the x spin component is not well defined when the spin is in the 
\o z = 1) state, we know that it is constant in time, since the Hamiltonian 
is zero. Thus, for example, the two-time observable cr x (t 4 ) — a x {t 2 ) = is 
definite (t 2 < £4). However, there is no state of N spins such that 

# = = . = ( 4 - 46 ) 

for every direction n as would be required for all the multi-time-correlations. 
At best, one may find a two-particle state eq. 11.11 for which the spins are 
anti- correlated instead of correlated i.e. o\ = —o\. However, e.g., for 3 
particles, only 2 of them can be completely anti-correlated, thus it cannot be 
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extended to N particles. 




h |$>i 



Figure 20: particle 1 is correlated to the pre-selected state of particle 2. 

Although a state of A" spin 1/2 particles with complete correlations 
among all their spin components as required by eq. (14.461) doesn't exist 
in the usual sense, there are pre-and-post-selected states with this prop- 
erty given by TSQM. By way of example (see figure 1201) . the post-selected 
state of particle 1 can be completely correlated with the pre-selected state 
of particle 2 as described by the state $ = ^ {(! |i| t)2 _ (t|i| 1)2}- We 
are now able to preserve the single particle's multi-time-correlations by sim- 
ply "stacking" the A" spin-1/2 particles "one on top of the other" along the 
time axis (fig. I2~T1) . As a result of the correlations between the pre-and- 
post-selected states, a verification measurement of ar x {t^) — a x (t 2 ) (see left 
part of fig. |2T]) . will yield 0, i.e. perfect multi-time correlations because 
& x (t2, particle 2) — a x (t 2 , particle 1) = (see right part of fig. |2~IT) . When 
"stacked" onto the time axis, these correlations act like the identity operator 
and thus evolve the state forward, handing-off or effectively propagating a 
state from one moment to the next (although nothing is "really" propagating 
in this picture). 

4.2 Destiny states: new solution to measurement prob- 
lem 

Up until now we have limited ourselves to the possibility of 2 boundary con- 
ditions which obtain their assignment due to selections made before and after 
a measurement. It is feasible and even suggestive to consider an extension 
of QM to include both a wavefunction arriving from the past and a second 
"destiny" wavefunction coming from the future which are determined by 2 
boundary conditions, rather than a measurement and selection. This pro- 
posal could solve the issue of the "collapse" of the wavefunction in a new and 
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stack 



time 



a x (t 4 ) t 4 




1 single particle 



particle""!' pafti'cTe" 2 'pafticTe"3 



Figure 21: Measuring a x (t4) — & x {t2) for the single spin- 1/2 particle on the left en- 
sures perfect multi-time-correlations because a x (t2, particle 2)— a x (t2, particle 1) = 
0. 



more natural way: every time a measurement takes place and the possible 
measurement outcomes decohere, then the future boundary condition simply 
selects one out of many possible outcomes [35j [32] . It also implies a kind of 
"teleology" which might prove fruitful in addressing the anthropic and fine- 
tuning issues [77] The possibility of a final boundary condition on the universe 
could be probed experimentally by searching for "quantum miracles" on a 
cosmological scale. While a "classical miracle" is a rare event that can be 
explained by a very unusual initial boundary-condition, "Quantum Miracles" 
are those events which cannot naturally be explained through any special ini- 
tial boundary- condition, only through initial-and-final boundary-conditions. 
By way of example, destiny-post-selection could be used to create the right 
dark energy or the right negative pressure (etc [81]). 
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5 Discussion of big questions and major un- 
knowns concerning time- symmetry 

5.1 Why God Plays Dice 

Why does uncertainty seem to play such a fundamental role in QM? First 
of all, uncertainty is necessary to obtain non-trivial pre-and-post-selections. 
In addition, this uncertainty is needed in the measuring-device to preserve 
causality. These two uncertainties work together perfectly [32] . Returning to 
the example discussed in §'s 11.2.11 ll.2.3[ I2.2.2[ since the weak- measurement 
result a/2 was "obtained" at a time arbitrarily earlier than the post-selection 
time, couldn't we then ascertain that a future post-selection should produce 
o y = +1, seemingly in violation of causality? While the weak- value depended 
on the post-selection, we now show that this cannot violate causality because 
the uncertainty in the measuring-device forces us to interpret the outcomes 
of weak-measurements as errors. If this were not true, then the outcome of 
a weak-measurement would force us to perform a particular post-selection 
(seemingly in violation of our free-will). In summary, eccentric- weak- values 
like this cannot be discerned with certainty from the statistics of pre-selected- 
only-ensembles for two principle reasons: 

• Analyticity of the measuring-device: Any measurement produces 
only bounded changes in the pointer variable 24 ! which can produce an 
"erroneous" value. The disturbance to the wavefunction of the system 
being measured is bounded only if we prepare the measuring-device in 
an initial state with Q bounded, i.e. &f* D (Q) has compact support. 
But this implies that the Fourier transform of i.e. $f£ D (P) 
is analytic. Therefore, there is a non-zero probability that the pointer 
produces "erroneous" values even from the initial state $^ D (P). That 
is, it must be possible to constructively produce interference in the 
tails of $f r (' D (P) in order to reconstruct the initial wavefunction of 
the measuring- device in the "forbidden" region, i.e. $^°(P — (A) w ) 
centered around A w , just as occurred with super-oscillations. 

• The probability to obtain the weak-value as an error of the 
measuring-device is greater than the probability of obtaining 



24 A bounded change in position, for example, occurs in the weak-measurements dis- 
cussed in Sj2j 
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an actual weak-value. This follows from the requirement that the 
uncertainty in P must be of the same order as the maximum separa- 
tion between the eigenvalues (figure Ob), so that superposition of the 
measuring-device wavefunction can destructively interfere in the region 
where the normal spectrum is defined@ 

Therefore we conclude that the weak-value structure is completely hidden if 
we are looking at a pre-selected-only system because the measuring-device al- 
ways hides the weak-value structure. If the spread in P did not hide the com- 
ponents of A w , then we could obtain some information about the choice of the 
post-selection, which could violate causality. Nevertheless, usually one says 
that a causal connection between events exists if the existence of a single event 
is "followed" by many other events, i.e. that there is a one-way correlation. 
If we consider a number N of weak- measurements during t G [ti n ,tfi n ], then 
when the correct post-selection is obtained, then this post-selection forces all 

the weak-measurements to be centered on = A 2 W = ... = A 1 ^ = ■ 
Therefore, the one-way correlation between f in | and A w is consistent with 
this "causality" condition. Finally, we have also used these considerations 
to probe the axiomatic structure of QM [321 [731 EH H3]- Traditionally, the 



25 As shown in §2.11 and [33] . there are several regimes for valid weak- measurement 
and each rigorously preserves causality: 1) we must have a small system-measuring-device 
interaction strength, A, as compared to strong- measurements in which the accuracy is 
increased by increasing A; 2) a very rare pre-and-post-selection; 3) with robust-wcak- 
measurcmcnt approach [33] , causality is again preserved because the corrections can only 
be made using the relative coordinates which can only be obtained after the particles 
go through the pre-and-post-selections. If one attempted to utilize very eccentric-weak- 
values, then we note that as the weak- value goes further and further outside the operator 
spectrum, |(\&fi n \ j ^in)| 2 from eq. 12.141 (the probability to see this particular weak- value) 
becomes smaller and smaller, and therefore the probability of obtaining these weak-values 
becomes smaller and smaller. As the fluctuation in the system increases, the probability 
of a rare or eccentric post-selection also increases. An attempt to discern this fluctuation 
through the use of weak-measurements requires the spread in the measuring-device to 
be increased. This increases the probability of seeing strange result as an error of the 
measuring-device. This is a general condition which protects causality: the probability 
of obtaining the weak-value as an error of the measuring-device must be greater than 
the probability of post-selection. In other words, restricting Q to a finite interval forces 
$(P) to be analytic which means that <&(P) has tails. The tails allow the exponential to 
be expanded (eq. 12 . 16[) and therefore, the measuring-device will register the weak-value- 
again without changing the shape of $(P)- The existence of these tails means that if the 
measuring-device registers v^, then it is more likely to be an error than a valid weak- value. 
This prevents any "acausal" indicator of the post-selection process. 
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uncertainty of QM meant that nature is capricious, i.e. "God playing dice." 
A different meaning for uncertainty can be obtained [32] from two axioms: 
1) the future is relevant to the present and 2) causality is maintained. In 
this program, uncertainty is derived as a consequence of the consistency be- 
tween causality and weak-values; in order to enrich nature with temporal 
non-locality, and yet preserve cause-effect relations, we must have uncer- 
tainty. 

5.2 The Problem of Free- Will 

The "destiny-generalization" of QM inspired by TSQM ( §4.21) posits that 
what happens in the present is a superposition of effects, with equal contri- 
bution from past and future events. At first blush, it appears that perhaps 
we, at the present, are not free to decide in our own mind what our future 
steps may be^l. Nevertheless, we have shown [32] that freedom-of-will and 
destiny can "peacefully co-exist" in a way consistent with the aphorism "All 
is foreseen, yet choice is given" J7SJ [7B] . 

The concept of free-will is mainly that the past may define the future, 
yet after this future effect takes place, i.e. after it becomes past, then it 
cannot be changed: we are free from the past, but, in this picture, we are 
not necessarily free from the future. Therefore, not knowing the future is a 
crucial requirement for the existence of free-will. In other words, the destiny- 
vector cannot be used to inform us in the present of the result of our future 
free choices. 

We have also shown [32] that free-will does not necessarily mean that 
nobody can in principle know what the future will be because any attempt to 
communicate such knowledge will make the memory system unstable, thereby 
allowing the freedom to change the future. Suppose there is a person who 
can see into the future, a prophet. Then while we, at the present are making 
a decision, and have not yet decided, the prophet knows exactly what this 
decision will be. At this point, as long as this prophet does not tell us what 
our decision will be, we are still free to make it, since we know that if the 
prophet had told us what our decision was going to be, then we would be 
free to change it and his prophecy would no longer be true. Therefore, the 
prophet could be accurate as long as he doesn't tell us our future decision. 



This was Bell's main concern with retrodictive solutions to Bell's theorem. 
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I.e., we are still free to make decisions based on nothing but the past and 
our own mind. Our decisions stand alone and the prophet's knowledge does 
not affect our free-will. 

From TSQM and the destiny-generalization, we may say that this prophet 
is the information of future measurements propagating back from the future 
to affect the results of measurements conducted at the present. Since a mea- 
surement of a weak-value is dependent upon a certain type of post-selection 
(which is only one of a few possible post-selections), but we at the present do 
not know whether the weak-value measured is due to an experimental error, 
or due to the post-selection. In addition, because a weak-measurement could 
be an error, there is nothing that forces us to perform a particular post- 
selection in the future. Only in the future, when all the measurements are 
finished and we actually make the post-selection, can we retrospectively con- 
clude whether the eccentric-weak- value shown by the measuring-device was 
either an error, or a real result due to the concrete post-selection. Again, 
the conditions for a weak-measurements require a high probability of exper- 
imental error. 

From this we conclude that our prophet, the post-selected vector coming 
from the future, does not tell us the information we need to violate our free- 
will, and we are still free to decide what kind of future measurements to 
conduct. Therefore, free-will survives. 

5.3 Emergence and Origin of Laws 

TSQM also provides novel perspectives on several other themes explored in 
this volume, e.g. on the question of emergence |24j : 

• Contextuality ( §2.3[) suggests that the measuring-device determines the 
sets of possible micro-states [69], [701 [32] . 

• A crucial component of contextuality, namely the failure of the product 
rule^l suggests other novel forms of emergence [32]. By way of example, 
another surprising pre-and-post-selection effect is the ability to sepa- 
rate a system from its properties [321 EES], as suggested by the Cheshire 
cat story: "Well! I've often seen a cat without a grin," thought Alice; 
"but a grin without a cat! Its the most curious thing I ever saw in all 



27 The weak-value of a product of observables is not equal to the product of their weak- 
values. 
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IT) = 



II) - 




Figure 22: Chesire Cat grin states. From [13] 

my life!" [79]. We approximate the cat by a single particle with grin 
states given by \a z = +1) (grinning) and \a z = — 1) (frowning). Besides 
spin, we also specify the particle's location as either in a box on the left 
or a box on the right (figure [22]) . Consider the pre-selection: 
|*m> = M {\e z = +1) + Wz = -1)} + \<I>r)\<Tz = +1) and the post- 
selected state: \V fin ) = {\ip L ) - \iJ>r))} {\<r z = +1) - \a z = -1)}. Us- 
ing the isomorphism between spin states and boxes, if Nl(+1) is the 
number of a z — +1 particles in the left box (etc.), then the total num- 
ber of particles in the left box is: Nl(+1) + A^(— 1) = 0. But the 
magnetic moment in the left box is: A^(+l) — Nl(—1) = 2N. Thus, 
there are no particles in the left box, yet there is twice the magnetic 
field there! Alice would say "Curiouser and curiouser": the particles 
are all in the right box, but there is no field there, thereby challenging 
the notion that all properties "sit" on the particle. 

• Finally, the "destiny-vector" ( §4.21) suggests a form of top-down causal- 
ity which is stable to fluctuations because post-selections are performed 
on the entire Universe and by definition no fluctuation exists outside 
the Universe. 

These are examples of emergence with respect to properties. As Barrows 
and Davies [77] have emphasized, the questions of fine-tuning, the origin 
of the physical laws, and the anthropic principle are significant outstanding 
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problems in physics. What novel perspectives can be gleaned from TSQM on 
these questions? E.g. the dynamics-kinematics generalization ( §4.11 [Ml 132]) 
suggests a novel way to think about dynamical laws. One implication is 
the fact that although we may know the dynamics on a particular time- 
scale T, this doesn't mean that we know anything about the dynamics on 
a smaller time-scale: consider a superposition of unitary evolutions (using 
e- !ffT = {e^F} N ): 

J g(v)e~ iH{u)t dv -> J g(u){l + iH(v)t}dv 1 + g{y)E{y)tdv 

if J giy)dv=\ 

(5.47) 

This theory is the same as the usual theory but with an effective Hamiltonian 

H eff = J g{y)H{y)tdv (5.48) 
The finer grained Hamiltonian can be expressed as a superposition of evo- 

-iHT -if3 n HT 

lutions e n = ct n e ^ , i.e. the Hamiltonian can be represented as a 
superposition of different laws given by pre-and-post-selection [32] . 
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